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1. (Amended) A method of extracting relevant data« comprising: 

accessing at least a first set of data of a first document, the first document including markup 
language, wherein the first set of data includes selected data of the first document, the selected data 
at least partly sp^ifying document data; 

accessing at least a second set of data of a second document, the second document including 
markup language; 



determining an edit sequence for identifying and matching a similarity between at least part 
of the first ^et of data and at least part of the second set of data, the edit sequence including any of 
insertions, deletions, and substitutions; and 

finding corresponding data of the second set of dat a to provide releva nt data that may be 
extracted , the corresponding data having a correspondence to the selected data, the correspondence 
at least partly found by determining the edit sequence. 

2, (Original) The method of claim 1, wherein the edit sequence includes none of insertions, 
deletions, and substitutions. 

3, (Original) The method of claim 1. wherein the edit sequence includes at least one of one or 
more insertions, one or more deletions, and one or more substitutions. 

4, (Original) The method of claim 1, wherein the edit sequence is at least partly determined by 
calculating a total cost, and each of one or more of insertions, deletions, substitutions, and matches is 
associated with one or more costs. 
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5. (Amended) The method of claim 4, wherein the one or more costs arc at least partly set in^ 
manner to facilitate [cncouraEel the edit sequence to include one or more matches between at least 
$ome markup language from the selected data of the first document and at least some markup 
language from the second document, the markup language including text-based content and tags. 

6. (Amended) The method of claim 4, wherein a first cost is associated with a first match at a 
fu*st distance from a root of a tree representation of some set of data, a second cost is associated with 
a second match at a second distance from a root of a tree representation of some set of data, the first 
distance is less than the second distance, and the first cost and the second cost are set in a manner to 
facilitate [encouraKe] the first match more than the second match. 



7. (Original) The method of claim 4» wherein a first cost is associated with u first insertion at a 
first distance from a root of a tree representation of some set of data, a second cost is associated with 
a second insertion at a second distance from a root of a tree representation of some set of data» the 
first distance is less than the second distance, and the first cost and the second cost are different. 

8. (Original) The method of claim 4, wherein a first cost is associated with a first deletion at a 
first distance from a root of a tree represenialion of some set of data, a second cost is associated with 
a second deletion at a second distance from a root of a tree representation of some set of data, the 
first distance is less than the second distance, and the first cost and the second cost arc different. 

9. (Original) The method of claim 4> wherein a first cost is associated with a first substitution at 
a first distance from a root of a tree representation of some set of data, a second cost is associated 
with a second substitution at a second distance from a root of a tree representation of some set of 
data, the first distance is less than the second distance, and the first cost and the second cost are 
different. 
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1 0. (AjTicndcd) The method of claim 4* wherein a first cost is associated with a first text-based 
content substitution such that a first length of substituting text-based content is substantially equal to 
a first length of substituted text-based content, a second cost is associated with a second text-based 
content substitution such that a second length of substituting tcxt-basod content is substantially 
dilTcrcnt from a second length of substituted text-based content, and the first cost and the second 
cost are set in a manner to inhibit fdiscouraae] the second text-based content substitution more than 
the first text-based content substitution, 

1 1 . (Amended) The method of claim 4, wherein markup language includes at least leKt-baseci 
content and tags, and tlic one or more costs are at least partly set a m anncf to jph' bit f discourage] 
substitutions of text-based content for one or more tags. 



1 2. ( Amended) The method of claim 4, wherein markup language includes at least tcxt^bascd 
content and tags, and the one or more costs are at least partly set in a manner to inhibit [discourage] 
substitutions of one or more tags for text-based content. 

U. (Amended) The method of claim 4, wherein a first cost is associated with preserving a firsi 
tag with unchanged attributes, a second cost is associated with preserving a second lag with one or 
more changed attributes, and the first cost and the second cost are set jn ^ jnaiiincr to inhibjt 
[discourage] preserving the second tag more than preserving the first tag. 

14. (Original) The method of claim 1, wherein document data is at least partly from the first 
document 

1 5. (Original) The method of claim 1, wherein document data is at least partly from the second 
document 

16. (Original) Tlie method of claim 1, wherein the second document is received if the second 
document is different from the first document. 
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1 7. (Original) The method of claim 1 » wherein the markup language includes at least HTML 
(Hypertext Markup Language). 




1 8. (Original) The method of claim 1 , wherein the markup language includes at least one of 
XML, a subset of XML, and a specialization of XML (extensible Markup Language), 



19. (Original) The method of claim 1 wherein the markup language includes at least WML 
(Wireless Markup Language). 



20, (Original) The method of claim 1 , wherein the martcup language includes at least one of 
SGML, a subset of SGML, and a specialization of SGML (Standard Generalized Markup Language). 
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